119 research outputs found

    Annotation of mathematical formulas in PDF documents

    Get PDF
    This article provides an overview of existing solutions for semantic analysis of mathematical documents, and also presents a method for automatic semantic analysis of documents in PDF format. This method searches for local variables in the text of the article, extracts their definitions and connects concepts with formulas. The advantage of the method over the existing ones is independence from the markup of theoriginal PDF document, which expands the scope of the method. We provide estimates of recall, precision and Fmeasure for algorithms for finding variables and linking local variables with formulas. The resulting semantic markup of the document will be used to create a collection of documents suitable for the semantic formula search service, which is part of the set of services of the Lobachevskii-DML digitalpublishing system

    OntoMathPROOntoMath^{PRO} Ontology: A Linked Data Hub for Mathematics

    Full text link
    In this paper, we present an ontology of mathematical knowledge concepts that covers a wide range of the fields of mathematics and introduces a balanced representation between comprehensive and sensible models. We demonstrate the applications of this representation in information extraction, semantic search, and education. We argue that the ontology can be a core of future integration of math-aware data sets in the Web of Data and, therefore, provide mappings onto relevant datasets, such as DBpedia and ScienceWISE.Comment: 15 pages, 6 images, 1 table, Knowledge Engineering and the Semantic Web - 5th International Conferenc

    Problem of transitivity of wikipedia category system

    Get PDF
    This paper analyses a violation of the transitivity principle of Wikipedia category system. Causes of the violation have been analyzed on base of ontological modeling methodologies such as Onto-Clean. A new approach for elimination of the violation has been proposed

    LoTA, a system for the analytical processing of technical texts: main concepts and design decisions

    Get PDF
    A variety of textual technical documents are elaborated and used in the process of system design of the onboard algorithmic and information display support for complex anthropocentric objects. One of these documents is titled "The Logic of Operation of the Crew-Onboard Equipment-Object System". It is vitally important to carry out computer analysis of these texts in order to ascertain whether or not they contain enough information to specify the main algorithms of functioning of the objects. This paper discusses the architecture of, and the linguistic support for, the computer system named "Logic/Text-Analysis" (LoTA), which is designed to analyze the completeness of the information content of Russian textual technical documents. Copyright Β©2007 by MAIK "Naitla/lnterperiodica"

    Mathematical text collections: Annotation and application for search tasks

    Get PDF
    This paper analyzes two models: semantic annotation of mathematical texts and semantic searching for mathematical texts in a marked-up collection. It also presents the results of a series of experiments that were performed with a semantically annotated collection of scientific publications in the field of mathematics. Β© 2013 Allerton Press, Inc

    Towards building Wordnet for the tatar language: A semantic model of the verb system

    Get PDF
    Β© Springer International Publishing Switzerland 2014. Wordnet is a lexical database where nouns, verbs, adjectives, and adverbs are organized in a conceptual hierarchy linking semantically and lexically related concepts to each other. This paper reports on the prototype of the Tatar Wordnet which currently contains about 5,500 Tatar verbs. Within our project we are creating a model of the semantic system of Tatar verbs as a hierarchical structure considering specifics of the Tatar language. For this purpose we use the entries of available Tatar dictionaries (explanatory dictionaries and those of synonyms). As the first step the extraction of available verbal synonyms from the dictionary of synonyms of the Tatar language was carried out. Then the most frequent 5156 Tatar verbs were selected and classified into several groups (synsets) according to their dominant semantic components with the purpose of adding new synsets and enriching those already existing (currently about 1,500 core synsets were distinguished). Then semantic relations between synsets were mapped (the verbs were linked according to their troponymy, entailment, and causality). The paper presents the results obtained, and discusses some problems encountered along the way

    RuThes cloud: Towards a multilevel linguistic linked open data resource for Russian

    Get PDF
    Β© 2017, Springer International Publishing AG. In this paper we present a new multi-level Linguistic Linked Open Data resource for Russian. It covers four linguistic levels: semantic, lexical, morphological and syntactic. The resource has been constructed on base of the well-known RuThes thesaurus and the original hitherto unpublished Extended Zaliznyak grammatical dictionary. The resource is represented in terms of SKOS, Lemon, and LexInfo ontologies and a new custom ontology. Building the resource, we automatically completed the following tasks: merging source resources upon common lexical entries, decomposing complex lexical entries, and publishing constructed resource as LLOD-compatible dataset. We demonstrate the use case in which the developed resource is exploited in IR task. We hope that our work can serve as a crystallization point of the LLOD cloud in Russian

    Optimizing DNA visualization with a solver P47H atomic-force microscope

    Get PDF
    The conditions for visualizing DNA molecules with a Solver P47H atomic-force microscope (NT-MTD, Moscow, Russia) were optimized. The DNA samples had different sizes, types, and conformations (pBR-322 plasmid DNA and chicken erythrocyte DNA) and were immobilized on mica. The microscope was equipped with a Smena-B detecting head and was operated in a tapping mode. The dependence of the amplitude of tip oscillations on the spacing between the tip and the test sample's surface was used to determine the optimum parameters of scanning. The highest quality and reproducibility of the DNA images were attained by scanning with a small initial amplitude (9-23 nm) of cantilever oscillations and an optimum gain (0.08-0.3). Images with the highest contrast were obtained in the amplitude curve's region corresponding to a repulsive interaction regime. The operating amplitude was set at one-half (or slightly less than) the initial amplitude of tip oscillations. Β© 2005 Pleiades Publishing, Inc

    Methods of automated design of application ontology

    Get PDF
    The control of completeness and information integrity of design specifications is an important problem of designing complex engineering systems. Computer aided design of textual technical documentation (technical documentation in natural language) is a complex problem. Its solution can be expected if natural (or given) limitations are imposed on the structure of the analyzed texts and an elaborated model of the application domain has been developed. In this paper, using the example of AviaOntology, the technological aspects of automatic design of applied ontologies that describe the application domain of the functioning of a complex technical system in various regimes of its operation are discussed. The problems of the use of the developed ontologies in problems of testing the information integrity of natural-language documents are considered

    Mathematical knowledge management: Ontological models and digital technology

    Get PDF
    This paper is discussed basic ideas, approaches and the results obtained in the research project the objective of which is to develop mathematical knowledge management technologies based on ontologies. We are developing the digital ecosystem OntoMath for mathematical knowledge management, which includes a set of specialized ontologies, text analytics tools and applications for managing mathematical knowledge. The results obtained are close to main problems declared in the World Digital Math Library (WDML) project. The main purpose of WDML is to build a global system of linked repositories for saving all digital mathematical documents, including contemporary and historic sources. This paper is devoted to decisions of some problems in this global initiative. In particular, we developed the program services for processing large collections of mathematical papers
    • …
    corecore